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SPEECH CODING APPARATUS AND SPEECH DECODING APPARATUS 

BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION: 
5 The present invention relates to a speech coding 

apparatus and speech decoding apparatus and, more 
particularly, to a speech coding apparatus for coding a 
speech signal at a low bit rate with high quality. 
DESCRIPTION OF THE PRIOR ART: 
10 As a conventional method of coding a speech signal 

with high efficiency, CELP (Code Excited Linear Predictive 
Coding) is known, which is disclosed, for example, in 
M. Schroeder and B. Atal, "Code-excited linear prediction: 
High quality speech at low bit rates", Proc. ICASSP, 1985, 
15 pp. 937-940 (reference 1) and Kleijn et al . , "Improved 
speech quality and efficient vector quantization in SELP", 
Proc. ICASSP, 1988, pp. 155-158 (reference 2). 

In this CELP coding scheme, on the transmission side, 
spectrum parameters representing a spectrum characteristic 
20 of a speech signal are extracted from the speech signal 
for each frame (for example, 20 ms) using linear 
predictive coding (LPC) analysis. Each frame is divided 
into subframes (for example, of 5 ms) , and for each 
subframe, parameters for an adaptive codebook (a delay 
25 parameter and a gain parameter corresponding to the pitch 
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period) are extracted based on the sound source signal in 
the past and then the speech signal of" the subframe is 
pitch predicted using the adaptive codebook. 

With respect to the sound source signal obtained by 
5 the pitch prediction, an optimum sound source code vector 
is selected from a sound source codebook (vector 
quantization codebook) consisting of predetermined types 
of noise signals, and an optimum gain is calculated to 
quantize the sound source signal* 

10 The selection of" a sound source code vector is 

performed so as to minimize the error power between a 
signal synthesized based on the selected noise signal and 
the residue signal* Then, an index and a gain 

representing the kind of the selected code vector as well 

15 as the spectrum parameter and the parameters of the 
adaptive codebook are combined and transmitted by a 
multiplexer section. A description of the operation of 
the reception side will be omitted. 

The conventional coding scheme described above is 

2 0 disadvantageous in that a large calculation amount is 
required to select an optimum sound source code vector 
from a sound source codebook . 

This arises from the fact that, in the methods in 
references 1 and 2, in order to select a sound source code 

25 vector, filtering or convolution calculation is performed 



once for each code vectors, and such calculation is 
repeated by a number of times equal to the number of code 
vectors stored in the codebook. 

Assume that the number of bits of the codebook is B 
5 and the order is N. In this case, if the filter or 
impulse response length in filtering or convolution 
calculation is K, the calculation amount required is N x K 
x 2B x 8000 per second. As an example, if B=10, N=40 and 
k=10, 81,920,000 calculations are required per second. In 

10 this manner, the conventional coding scheme is 
disadvantageous in that it requires a very large 
calculation size. 

Various methods which reduce the calculation amount 
required to search a sound source codebook have been 

15 proposed. One of the methods is an ACE LP {Algebraic Code 
Excited Linear Prediction) method, which is disclosed, for 
example , in C. Laflamme et al . , "16 kbps wideband speech 
coding technique based on algebraic CELP", Proc. ICASSP, 
1991, pp. 13-16 (reference 3). 

20 According to the method disclosed in reference 3, a 

sound source signal is represented by a plurality of 
pulses and transmitted while the positions of the 
respective pulses are represented by predetermined numbers 
of bits. In this case, since the amplitude of each pulse 

25 is limited to +1.0 or -1.0, the calculation amount 
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required to search pulses can be greatly reduced* 

As described above , according to the method disclosed 
in reference 3, a great reduction in calculation amount 
can be attained . 
5 Another problem is that at a bit rate less than 8 

kb/s, especially when background noise is superimposed on 
speech, the background noise portion of the coded speech 
greatly deteriorates in sound quality , although the sound 
quality is good at 8 kb/s or higher* 

10 Such a problem arises for the following reason* 

Since a sound source is represented by a combination of a 
plurality of pulses, pulses concentrate near a pitch pulse 
as the start point of a pitch in a vowel interval of 
speech. This signal can therefore be efficiently 

15 expressed by a small number of pulses. For a random 
signal like background noise, however, pulses must be 
randomly generated, and hence the background noise cannot 
be properly expressed by a small number of pulses. As a 
consequence, if the bit rate decreases, and the number of 

20 pulses decreases, the sound quality of background noise 
abruptly deteriorates. 

SUMMARY OF THE INVENTION 
The present invention has been made in consideration 
of the above situation in the prior art, and has as its 

2 5 object to provide a speech coding system which can solve 



X 



the above problems and suppress a deterioration in sound 
quality in terms of background noise, in particular , with 
a relatively small calculation amount. 

In order to achieve the above object, a speech coding 
5 apparatus according to the first aspect of the present 
invention including a spectrum parameter calculation 
section for receiving a speech signal, obtaining a 
spectrum parameter, and quantizing the spectrum parameter, 
an adaptive codebook section for obtaining a delay and a 

10 gain from a past quantized sound source signal by using an 
adaptive codebook, and obtaining a residue by predicting a 
speech signal, and a sound source quantization section for 
quantizing a sound source signal of the speech signal by 
using the spectrum parameter and outputting the sound 

15 source signal is characterized by comprising a 
discrimination section for discriminating a mode on the 
basis of a past quantized gain of an adaptive codebook, a 
sound source quantization section which has a codebook for 
representing a sound source signal by a combination of a 

2 0 plurality of non-zero pulses and collectively quantizing 
amplitudes or polarities of the pulses when an output from 
the discrimination section indicates a predetermined mode, 
and searches combinations of code vectors stored in the 
codebook and a plurality of shift amounts used to shift 

25 positions of the pulses so as to output a combination of a 



code vector and shift amount which minimizes distortion 
relative to input speech, and a multiplexer section for 
outputting a combination of an output from the spectrum 
parameter calculation section, an output from the adaptive 
5 codebook section, and an output from the sound source 
quantization section . 

A speech coding apparatus according to the second 
aspect of the present invention including a spectrum 
parameter calculation section for receiving a speech 

10 signal, obtaining a spectrum parameter, and quantizing the 
spectrum parameter, an adaptive codebook section for 
obtaining a delay and a gain from a past quantized sound 
source signal by using an adaptive codebook, and obtaining 
a residue by predicting a speech signal , and a sound 

15 source quantization section for quantizing a sound source 
signal of the speech signal by using the spectrum 
parameter and outputting the sound source signal, is 
characterized by comprising a discrimination section for 
discriminating a mode on the basis of a past quantized 

20 gain of an adaptive codebook, a sound source quantization 
section which has a codebook for representing a sound 
source signal by a combination of a plurality of non-zero 
pulses and collectively quantizing amplitudes or 
polarities of the pulses when an output from the 

25 discrimination section indicates a predetermined mode, and 



outputs a code vector that minimizes distortion relative 
to input speech by generating positions of the pulses 
according to a predetermined rule, and a multiplexer 
section for outputting a combination of an output from the 
spectrum parameter calculation section, an output from the 
adaptive codebook section, and an output from the sound 
source quantization section. 

A speech coding apparatus according to the third 
aspect of the present invention including a spectrum 
parameter calculation section for receiving a speech 
signal, obtaining a spectrum parameter, and quantizing the 
spectrum parameter, an adaptive codebook section for 
obtaining a delay and a gain from a past quantized sound 
source signal by using an adaptive codebook, and obtaining 
a residue by predicting a speech signal, and a sound 
source quantization section for quantizing a sound source 
signal of the speech signal by using the spectrum 
parameter and outputting the sound source signal is 
characterized by comprising a discrimination section for 
discriminating a mode on the basis of a past quantized 
gain of an adaptive codebook, a sound source quantization 
section which has a codebook for representing a sound 
source signal by a combination of a plurality of non-zero 
pulses and collectively quantizing amplitudes or 
polarities of the pulses when an output from the 
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discrimination section indicates a predetermined mode, and 
a gain codebook for quantizing gains, and searches 
combinations of code vectors stored in the codebook , a 
plurality of shift amounts used to shift positions of the 
5 pulses, and gain code vectors stored in the gain codebook 
so as to output a combination of a code vector, shift 
amount, and gain code vector which minimizes distortion 
relative to input speech, and a multiplexer section for 
outputting a combination of an output from the spectrum 

10 parameter calculation section, an output from the adaptive 
codebook section, and an output from the sound source 
quantization section . 

A speech coding apparatus according to the fourth 
aspect of the present invention including a spectrum 

15 parameter calculation section for receiving a speech 
signal, obtaining a spectrum parameter, and quantizing the 
spectrum parameter, an adaptive codebook section for 
obtaining a delay and a gain from a past quantized sound 
source signal by using an adaptive codebook, and obtaining 

20 a residue by predicting a speech signal, and a sound 
source quantization section for quantizing a sound source 
signal of the speech signal by using the spectrum 
parameter and outputting the sound source signal is 
characterized by comprising a discrimination section for 

25 discriminating a mode on the basis of a past quantized 



gain of an adaptive codebook, a sound source quantization 
section which has a codebook for representing a sound 
source signal by a combination of a plurality of non-zero 
pulses and collectively quantizing amplitudes or 
5 polarities of the pulses when an output from the 
discrimination section indicates a predetermined mode, and 
a gain codebook for quantizing gains, and outputs a 
combination of a code vector and gain code vector which 
minimizes distortion relative to input speech by 

10 generating positions of the pulses according to a 
predetermined rule, and a multiplexer section for 
outputting a combination of an output from the spectrum 
parameter calculation section, an output from the adaptive 
codebook section, and an output from the sound source 

15 quantization section. 

A speech decoding apparatus according to the fifth 
aspect of the present invention is characterized by 
comprising a demultiplexer section for receiving and 
demultiplexing a spectrum parameter, a delay of an 

2 0 adaptive codebook, a quantized gain, and quantized sound 
source information, a mode discrimination section for 
discriminating a mode by using a past quantized gain in 
the adaptive codebook, and a sound source signal 
reconstructing section for reconstructing a sound source 

25 signal by generating non-zero pulses from the quantized 



sound source information when an output from the 
discrimination section indicates a predetermined mode, 
wherein a speech signal is reproduced by passing the sound 
source signal through a synthesis filter section 
constituted by spectrum parameters. 

As is obvious from the above aspects, according to 
the present invention, the mode is discriminated on the 
basis of the past quantized gain of the adaptive codebook . 
If a predetermined mode is discriminated, combinations of 
code vectors stored in the codebook, which is used to 
collectively quantize the amplitudes or polarities of a 
plurality of pulses, and a plurality of shift amounts used 
to temporally shift predetermined pulse positions are 
searched to select a combination of a code vector and 
shift amount which minimizes distortion relative to input 
speech. With this arrangement, even if the bit rate is 
low, a background noise portion can be properly coded with 
a relatively small amount calculation amount. 

In addition, according to the present invention, a 
combination of a code vector, shift amount, and gain code 
vector which minimizes distortion relative to input speech 
is selected by searching combinations of code vectors, a 
plurality of shift amounts, and gain code vectors stored 
in the gain codebook for quantizing gains. With this 
operation, even if speech on which background noise is 
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superimposed is coded at a low bit rate, a background 
noise portion can be properly coded. 

The above and many other objects, features and 
advantages of the present invention will become manifest 
5 to those skilled in the art upon making reference to the 
following detailed description and accompanying drawings 
in which preferred embodiments incorporating the 
principles of the present invention are shown by way of 
illustrative examples . 
10 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram showing the schematic 
arrangement of the first embodiment of the present 
invention; 

Fig. 2 is a block diagram showing the schematic 
15 arrangement of the second embodiment of the present 
invention; 

Fig. 3 is a block diagram showing the schematic 
arrangement of the third embodiment of the present 
invention; 

20 Fig. 4 is a block diagram showing the schematic 

arrangement of the fourth embodiment of the present 
invention; and 

Fig. 5 is a block diagram showing the schematic 
arrangement of the fifth embodiment of the present 

25 invention. 



1 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Several embodiments of the present invention will be 
described below with reference to the accompanying 
drawings . In a speech coding apparatus according to an 
5 embodiment of the present invention, a mode discrimination 
circuit (370 in Fig. 1) discriminates the mode on the 
basis of the past quantized gain of an adaptive codebook. 
When a predetermined mode is discriminated, a sound source 
quantization circuit (350 in Fig. 1) searches combinations 

10 of code vectors stored in a codebook (351 or 352 in 
Fig. 1) , which is used to collectively quantize the 
amplitudes or polarities of a plurality of pulses , and a 
plurality of shift amounts used to temporally shift 
predetermined pulse positions, to select a combination of 

15 a code vector and shift amount which minimizes distortion 
relative to input speech. A gain quantization circuit 
(365 in Fig. 1) quantizes gains by using a gain codebook 
(380 in Fig. 1) . 

According to a preferred embodiment of the present 

2 0 invention , a speech decoding apparatus includes a 
demultiplexer section (510 in Fig. 5) for receiving and 
demultiplexing a spectrum parameter, a delay of an 
adaptive codebook, a quantized gain, and quantized sound 
source information, a mode discrimination section (530 in 

25 Fig. 5) for discriminating the mode on the basis of the 



past quantized gain of the adaptive codebook, and a sound 
source decoding section (540 in Fig. 5) for reconstructing 
a sound source signal by generating non-zero pulses from 
the quantized sound source information, A speech signal 
5 is reproduced or resynthesized by passing the sound source 
signal through a synthesis filter (560 in Fig. 5) defined 
by spectrum parameters . 

According to a preferred embodiment of the present 
invention, a speech coding apparatus according to the 

10 first aspect of the present invention includes a spectrum 
parameter calculation section for receiving a speech 
signal, obtaining a spectrum parameter, and quantizing the 
spectrum parameter, an adaptive codebook section for 
obtaining a delay and a gain from a past quantized sound 

15 source signal by using an adaptive codebook, and obtaining 
a residue by predicting a speech signal, and a sound 
source quantization section for quantizing a sound source 
signal of the speech signal by using the spectrum 
parameter and outputting the sound source signal is 

20 characterized by comprising a discrimination section or 
discriminating a mode on the basis of a past quantized 
gain of an adaptive codebook, a sound source quantization 
section which has a codebook for representing a sound 
source signal by a combination of a plurality of non-zero 

25 pulses and collectively quantizing amplitudes or 
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polarities of the pulses when an output from the 
discrimination section indicates a predetermined mode, and 
searches combinations of code vectors stored in the 
codebook and a plurality of shift amounts used to shift 
5 positions of the pulses so as to output a combination of a 
code vector and shift amount which minimizes distortion 
relative to input speech, and a multiplexer section for 
outputting a combination of an output from the spectrum 
parameter calculation section, an output from the adaptive 

10 codebook section, an output from the sound source 
quantization section, a demultiplexer section for 
receiving and demultiplexing a spectrum parameter, a delay 
of an adaptive codebook, a quantized gain, and quantized 
sound source information, a mode discrimination section 

15 for discriminating a mode by using a past quantized gain 
in the adaptive codebook, and a sound source signal 
reconstructing section for reconstructing a sound source 
signal by generating non-zero pulses from the quantized 
sound source information when an output from the 

2 0 discrimination section indicates a predetermined mode, A 
speech signal is reproduced by passing the sound source 
signal through a synthesis filter section constituted by 
spectrum parameters ♦ 

A speech coding apparatus according to the present 

25 invention includes a spectrum parameter calculation 
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section for receiving a speech signal, obtaining a 
spectrum parameter, and quantizing the spectrum parameter, 
an adaptive codebook section for obtaining a delay and a 
gain from a past quantized sound source signal by using an 
5 adaptive codebook, and obtaining a residue by predicting a 
speech signal, and a sound source quantization section for 
quantizing a sound source signal of the speech signal by 
using the spectrum parameter and outputting the sound 
source signal, is characterized by comprising a 

10 discrimination section for discriminating a mode on the 
basis of a past quantized gain of an adaptive codebook, a 
sound source quantization section which has a codebook for 
representing a sound source signal by a combination of a 
plurality of non-zero pulses and collectively quantizing 

15 amplitudes or polarities of the pulses when an output from 
the discrimination section indicates a predetermined mode, 
and outputs a code vector that minimizes distortion 
relative to input speech by generating positions of the 
pulses according to a predetermined rule, and a 

2 0 multiplexer section for outputting a combination of an 
output from the spectrum parameter calculation section, an 
output from the adaptive codebook section, an output from 
the sound source quantization section, a demultiplexer 
section for receiving and demultiplexing a spectrum 

2 5 parameter, a delay of an adaptive codebook, a quantized 



- 16 - 



gain, and quantized sound source information, a mode 
discrimination section for discriminating a mode by using 
a past quantized gain in the adaptive codebook, and a 
sound source signal reconstructing section for 
5 reconstructing a sound source signal by generating pulse 
positions according to a predetermined rule and generating 
amplitudes or polarities for the pulses from a code vector 
to generate a sound source signal when the output from the 
discrimination section indicates a predetermined mode. A 

10 speech signal is reproduced by passing the sound source 
signal through a synthesis filter section constituted by 
spectrum parameters . 
First Embodiment : 

Fig. 1 is a block diagram showing the arrangement of 

15 a speech coding apparatus according to an embodiment of 
the present invention. 

Referring to Fig. 1, when a speech signal is input 
through an input terminal 100, a frame division circuit 
110 divides the speech signal into frames (for example, of 

2 0 20 ms) . A subframe division circuit 120 divides the 
speech signal of each frame into subf rames (for example, 
of 5 ms) shorter than the frames. 

A spectrum parameter calculation circuit 200 extracts 
speech from the speech signal of at least one subframe 

2 5 using a window (for example, of 24 ms) longer than the 
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subframe length and calculates spectrum parameters by 
computations of a predetermined order (for example, P - 
10) . In this case, for the calculation of spectrum 
parameters, an LPC analysis, a Burg analysis, and the like 
5 which are well known in the art can be used. In this case, 
the Burg analysis is used. Since the Burg analysis is 
disclosed in detail in Nakamizo, "Signal Analysis and 
System Identification", Corona, 1988, pp. 82 - 87 
(reference 4) , a description thereof will be omitted. 

10 In addition, a spectrum parameter calculation circuit 

210 transforms linear predictive coefficients a il 
(i— 1,..., 10) calculated using the Burg method into LSP 
parameters suitable for quantization and interpolation. 
Such transformation from linear predictive coefficients 

15 into LSP parameters is disclosed in Sugamura et al * , 
"Speech Data Compression by LSP Speech Analysis-Synthesis 
Technique", Journal of the Electronic Communications 
Society of Japan, J64-A, 1981, pp. 599-606 (reference 5) . 

For example, linear predictive coefficients 

2 0 calculated for the second and fourth subframes based on 
the Burg method are transformed into LSP parameters 
whereas LSP parameters of the first and third subframes 
are determined by linear interpolation, and the LSP 
parameters of the first and third subframes are inversely 

25 transformed into linear predictive coefficients. Then, 
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the linear predictive coefficients a il (1=1,..., 10, 

1=1,..., 5) of the first to fourth subframes are output to 

a perceptual weighting circuit 230. The LSP parameters of 

the fourth subf rame are output to the spectrum parameter 

5 quantization circuit 210. 

The spectrum parameter quantization circuit 210 

efficiently quantizes the LSP parameters of a 

predetermined subf rame from the spectrum parameters and 

outputs a quantization value which minimizes the 

10 distortion given by: 

= J W(i)[LSP(i) - QLSP(i) 3 ] 2 ... (1) 

1=1 

where LSP(i), QLSP(i) j/ and W(i) are the LSP parameter of 
the ith-order before quantization, the jth result after 
the quantization, and the weighting coefficient, 

15 respectively. 

In the following description, it is assumed that 
vector quantization is used as a quantization method, and 
LSP parameters of the fourth subf rame are quantized. 
Any known technique can be employed as the technique for 

2 0 vector quantization of LSP parameters. More specifically, 
a technique disclosed in, for example, Japanese Unexamined 
Patent Publication No. 4-171500 (Japanese Patent 
Application No. 2-297600) (reference 6) , Japanese 
Unexamined Patent Publication No. 4-363000 (Japanese 
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Patent Application No. 3-261925) (reference 7) , Japanese 
Unexamined Patent Publication No. 5-6199 (Japanese Patent 
Application No. 3-155049) (reference 8) , T. Nomura et al . , 
"LSP Coding VQ-SVQ with Interpolation in 4.075 kbps M- 
5 LCELP Speech Coder", Proc. Mobile Multimedia 
Communications, 1993, pp. B.2.5 (reference 9) or the like 
can be used. Accordingly, a description of details of the 
technique is omitted herein. 

The spectrum parameter quantization circuit 210 

10 reconstructs the LSP parameters of the first to fourth 
subframes based on the LSP parameters quantized with the 
fourth subframe. Here, linear interpolation of the 

quantization LSP parameters of the fourth subframe of the 
current frame and the quantization LSP parameters of the 

15 fourth subframe of the immediately preceding frame is 
performed to reconstruct LSP parameters of the first to 
third subframes . 

In this case, after a code vector which minimizes the 
error power between the LSP parameters before quantization 

2 0 and the LSP parameters after quantization is selected, the 
LSP parameters of the first to fourth subframes are 
reconstructed by linear interpolation. In order to 

further improve the performance, after a plurality of 
candidates are first selected as a code vector which 

2 5 minimizes the error power, the accumulated distortion may 
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be evaluated with regard to each of the candidates to 
select a set of a candidate and an interpolation LSP 
parameter which exhibit a minimum accumulated distortion. 
The details of this technique are disclosed, for example, 
5 in Japanese Patent Application No. 5-8737 (reference 10) . 

The LSP parameters of the first to third subframes 
reconstructed in such a manner as described above and the 
quantization LSP parameters of the fourth subframe are 
transformed into linear predictive coefficients a il 

10 (i=l , . . . , 10, 1=1,..., 5) for each subframe, and the 
linear predictive coefficients are output to the impulse 
response calculation circuit 310. Furthermore, an index 
representing the code vector of the quantization LSP 
parameters of the fourth subframe is output to a 

15 multiplexer 4 00. 

The perceptual weighting circuit 230 receives the 
linear predictive coefficients a il (i=l, . . . , 10, 1=1, . . . , 
5) before quantization for each subframe from the spectrum 
parameter calculation circuit 200, performs perceptual 

2 0 weighting for the speech signal of the subframe on the 
basis of the method described in reference 1 and outputs a 
resultant perceptual weighting signal. 

A response signal calculation circuit 240 receives 
the linear predictive coefficients a il for each subframe 

25 from the spectrum parameter calculation circuit 200, 
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receives the linear predictive coefficients a il 
reconstructed by quantization and interpolation for each 
subframe from the spectrum parameter quantization circuit 
210, calculates, for one subframe, a response signal with 
5 which the input signal is reduced to zero d(n)=0 using a 
value stored in an interval filter memory, and outputs the 
response signal to a subtracter 235. In this case, the 
response signal x z (n) is represented by: 

10 10 10 

x 2 (n) = d(n) - 2 a i d < n ~ i} Z a^Yto - i) + £ a iYX( n " ^ 
i= i i=i i=i 

10 •••< 2 ) 
If n - i < 0, then 

y(n i) P(N + (n 

i)) ...(3) 

x 2 (n - i) = s v (N + (n 

15 i)) ..-(4) 

where N is the subframe length, y is the weighting 
coefficient for controlling the perceptual weighting 
amount and has a value equal to the value of equation (7) 
given below, and s w (n) and p(n) are an output signal of a 
2 0 weighting signal calculation circuit 360 and an output 
signal of the term of the denominator of a filter 
described by the first term of the right side of equation 
(7) , respectively* 

The subtracter 235 subtracts response signals x2 (n) 
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corresponding to one subframe from the perceptual 
weighting signal x w (n) by: 

x^n) = x w (n) - x x (n) . . • (5) 

and outputs a signal x' w (n) to an adaptive codebook 
circuit 500. 

The impulse response calculation circuit 310 
calculates only a predetermined number L of impulse 
responses h w (n) of a perceptual weighting filter H(z) 
whose z- transform (transfer function) is represented by: 



10 



1 - 2 a.Z" 1 1 

10 H W (Z) = ro ^ To ••• (6) 

l-Xayz-'i-joyz- 1 

1=1 1=1 

and outputs them to the adaptive codebook circuit 500 and 
a sound source quantization circuit 350. 

The adaptive codebook circuit 500 receives a sound 
source signal v(n) in the past from a gain quantization 

15 circuit 366, receives the output signal x T w (n) from the 
subtracter 235 and the impulse responses h w (n) from the 
impulse response calculation circuit 310. Then, the 
adaptive codebook circuit 500 calculates a delay DT 
corresponding to the pitch, which minimizes the distortion 

2 0 given by: 



N-l 



D T = Z x w (n) 



n = 0 



N-l 



2 x>)y w (n - T) 



n = 0 



2 

/ 



N-l 



(7) 



for y w (n - T) = v(n - T)*h w (n) ...(8) 
and outputs an index representing the delay to the 



multiplexer 400. 

where the symbol * signifies a convolution calculation. 
A gain jS is obtained by: 

P = S *>)y w (n - T) / y*(n - T) . . . ( 9 ) 

n=0 n=0 

In this case, in order to improve the extraction 
accuracy of a delay for the voice of a woman or a child, 
the delay may be calculated not as an integer sample value 
but a decimal fraction sample value. A detailed method is 
disclosed, for example, in P. Kroon et. al . , "Pitch 
predictors with high terminal resolution", Proc. ICASSP, 
1990, pp. 661-664 (reference 11). 

In addition, the adaptive codebook circuit 500 
performs pitch prediction: 

e w (n) = x^(n) - pv(n - T) * h w (n) . . . (10) 

and outputs a resultant predictive residue signal e w (n) to 
the sound source quantization circuit 350. 

A mode discrimination circuit 370 receives the 
adaptive codebook gain j8 quantized by the gain 
quantization circuit 366 one subframe ahead of the current 
subframe, and compares it with a predetermined threshold 
Th to perform voiced/unvoiced determination. More 
specifically, if j8 is larger than the threshold Th, a 
voiced sound is determined. If jS is smaller than the 
threshold Th, an unvoiced sound is determined. The mode 
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discrimination circuit 370 then outputs a voiced/unvoiced 
discrimination information to the sound source 
quantization circuit 350, the gain quantization circuit 
366, and the weighting signal calculation circuit 360. 
5 The sound source quantization circuit 350 receives 

the voiced/unvoiced discrimination information and 
switches pulses depending on whether a voiced or an 
unvoiced sound is determined. 

Assume that M pulses are generated for a voiced sound. 

10 For a voiced sound, a B-bit amplitude codebook or 

polarity codebook is used to collectively quantize the 
amplitudes of pulses in units of M pulses. A case wherein 
the polarity codebook is used will be described below. 
This polarity codebook is stored in a codebook 351 for a 

15 voiced sound, and is store din a codebook 352 for an 
unvoiced sound. 

For a voiced sound, the sound source quantization 
circuit 350 reads out polarity code vectors from the 
codebook 351, assigns positions to the respective code 

2 0 vectors, and selects a combination of a code vector and a 
position which minimizes the distortion given by: 



n = 0 



i = l 



(ID 



where h w (n) is the perceptual weighting impulse response. 

Equation (11) can be minimized by obtaining a 
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combination of an amplitude code vector k and a position 
mi which maximizes D (kfi) given by: 



n = 0 



/ 2 S wkK> 



(12) 



n = 0 



where s wk (mi) is calculated according to equation (5) above. 
Alternatively, a combination which maximizes D (]C/i) : 



D, 



2 (|>(n)v k (n) 



/ Z s wkK) 



. . . (13) 



n = 0 



N-1 



for (f)(n) = Z e w(i)h w (i — n) / n = 0,...,N — 1 
. - . (14) 

may be selected. The calculation amount required for the 
10 numerator is smaller in this operation than in the above 
operation . 

In this case, to reduce the calculation amount, the 
positions that the respective pulses can assume for a 
voiced sound can be limited as in reference 3. If, for 
15 example, N = 4 0 and M = 5, the possible positions of the 
respective pulses are given by Table 1. 

Table 1 



o, 


5, 


10, 


15, 


20, 


25, 


30, 


35 


1, 


6, 


11/ 


16, 


21, 


26, 


31, 


36 


2, 


6, 


12, 


17, 


22, 


27, 


32, 


37 


3, 


8, 


13, 


18, 


23, 


28, 


33, 


38 


4, 


9, 


14, 


19/ 


24, 


29, 


34, 


39 



An index representing a code vector is then output to 
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10 



15 



20 



the multiplexer 400. 

Furthermore, a pulse position is quantized with a 
predetermined number of bits, and an index representing 
the position is output to the multiplexer 400. 

For unvoiced periods, as indicated by Table 2, pulse 
positions are set at predetermined intervals, and shift 
amounts for shifting the positions of all pulses are 
determined in advance . In the following case, the pulse 
positions are shifted in units of samples, and fourth 
types of shift amounts (shift 0, shift 1, shift 2, and 
shift 3) can be used. In this case, the shift amounts are 
quantized with two bits and transmitted. 

Table 2 



Pulse Position 



0, 4, 8, 12, 16, 20, 24, 28, 



The sound source quantization circuit 350 further 
receives polarity code vectors from the polarity codebook 
(sound source codebook) 352, and searches combinations of 
all shift amounts and all code vectors to select a 
combination of a shift amount 8 (j) and a code vector gk 
which minimizes the distortion given by: 



M-l 



n=0 



e„(n) - X Si* h w(n - m 1 - 5(j) 



1=1 



(15) 



An index representing the selected code vector and a 
code representing the selected shift amount are sent to 
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the multiplexer 400. 

Note that a codebook for quantizing the amplitudes of 
a plurality of pulses can be learnt in advance by using 
speech signals and stored* A learning method for the 
5 codebook is disclosed, for example, in "An algorithm for 
vector quantization design", IEEE Trans. Commun., January 
1980, pp. 84-95) (reference 12). 

The information of amplitudes and positions of voiced 
and unvoiced periods are output to the gain quantization 
10 circuit 366. 

The gain quantization circuit 366 receives the 
amplitude and position information from the sound source 
quantization circuit 350, and receives the voiced/unvoiced 
discrimination information from the mode discrimination 
15 circuit 370. 

The gain quantization circuit 366 reads out gain code 
vectors from a gain codebook 380 and selects one gain code 
vector that minimizes equation (16) below for the selected 
amplitude code vector or polarity code vector and the 
20 position. Assume that both the gain of the adaptive 
codebook and the sound source gain represented by a pulse 
are vector quantized simultaneously. 

When the discrimination information indicates a 
voiced sound, a gain code vector is obtained to minimize 
2 5 D k given by: 
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x w (n) - p>(n - T) * h w (n) - gl k h w (n - 

i = l 

. . . (16) 

where & k and Gk are kth code vectors in a two-dimensional 
gain codebook stored in the gain codebook 380. An index 
5 representing the selected gain code vector is output to 
the multiplexer 400. 

If" the discrimination information indicates an 
unvoiced sound, a gain code vector is searched out which 
minimizes D k given by: 

m -f 
x w (n) - fcv(n - T) * h w (n) - g[ k hjn - m, - S(j) ) 

i = l 

. • - (17) 

An index representing the selected gain code vector 
is output to the multiplexer 400. 

The weighting signal calculation circuit 360 receives 
15 the voiced/unvoiced discrimination information and the 
respective indices and reads out the corresponding code 
vectors according to the indices. For a voiced sound, the 
driving sound source signal v(n) is calculated by: 

M 

v(n) = P>(n - T) + G^ 9[ k 8(n - mj ... (18) 

2 0 This driving sound source signal v(n) is output to 

the adaptive codebook circuit 500. 

For an unvoiced sound, the driving sound 1 source 
signal v(n) is calculated by: 



n = 0 



10 



N-1 

= z 



M 

v(n) = (3>(n - T) + G[J] - m L - 5(i) ) . . . (19) 

1=1 

This driving sound source signal v(n) is output to 
the adaptive codebook circuit 500. 

Subsequently, the response signals s w (n) are 
calculated in units of subframes by using the output 
parameters from the spectrum parameter calculation circuit 
200 and spectrum parameter calculation circuit 210 using 

10 10 io 

s w (n) = v(n) - 2 ^vtn - i) + ]T a^pfn - i) + aj'sjn - i) 
1=1 1=1 1=1 

. . . (20) 

and are output to the response signal calculation circuit 
240. 

Second Embodiment 

Fig. 2 is a block diagram showing the schematic 
arrangement of the second embodiment of the present 
invention . 

Referring to Fig. 2, the second embodiment of the 
present invention differs from the above embodiment in the 
operation of a sound source quantization circuit 355. 
More specifically, when voiced/unvoiced discrimination 
information indicates an unvoiced sound, the positions 
that are generated in advance in accordance with a 
predetermined rule are used as pulse positions. 

For example, a random number generating circuit 600 
is used to generate a predetermined number of (e.g., Ml) 
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pulse positions. That is, the Ml values generated by the 
random number generating circuit 600 are used as pulse 
positions. The Ml positions generated in this manner are 
output to the sound source quantization circuit 355. 
5 If the discrimination information indicates a voiced 

sound, the sound source quantization circuit 355 operates 
in the same manner as the sound source quantization 
circuit 350 in Fig. 1. If the information indicates an 
unvoiced sound, the amplitudes or polarities of pulses are 

10 collectively quantized by using a sound source codebook 
352 in correspondence with the positions output from the 
random number generating circuit 600. 
Third Embodiment 

Fig. 3 is a block diagram showing the arrangement of 

15 the third embodiment of the present invention. 

Referring to Fig. 3, in the third embodiment of the 
present invention, when voiced/un voiced discrimination 
information indicates an unvoiced sound, a sound source 
quantization circuit 356 calculates the distortions given 

2 0 by equations (21) below in correspondence with all the 
combinations of all the code vectors in a sound source 
codebook 352 and the shift amounts of pulse positions, 
selects a plurality of combinations in the order which 
minimizes the distortions given by: 
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N-l 



^ = z 



e w( n ) - Z 9i* h w( n - - 5(j) 



n2 

. - . (21) 



and outputs them to a gain quantization circuit 366. 

The gain quantization circuit 366 quantizes gains for 
a plurality of sets of outputs from the sound source 
quantization circuit 356 by using a gain codebook 380, and 
selects a combination of a shift amount, sound source code 
vector, and gain code vector which minimizes distortions 
given by: 



n = 0 



x w (n) - p; V (n - T) * h w (n) - g' lk h w (n - m, - 5(j) ) 



i=l 



10 ... (22) 

Fourth Embodiment 

Fig. 4 is a block diagram showing the arrangement of 
the fourth embodiment of the present invention. 

Referring to Fig. 4, in the fourth embodiment of the 

15 present invention, when voiced/unvoiced discrimination 
information indicates an unvoiced sound, a sound source 
quantization circuit 357 collectively quantizes the 
amplitudes or polarities of pulses for the pulse positions 
generated by a random number generating circuit 600 by 

20 using a sound source codebook 352, and outputs all the 
code vectors or a plurality of code vector candidates to a 
gain quantization circuit 367. 

The gain quantization circuit 367 quantizes gains for 
the respective candidates output from the sound source 
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quantization circuit 357 by using a gain codebook 380, and 

outputs a combination of a code vector and gain code 

vector which minimizes distortion. 

Fifth Embodiment 
5 Fig. 5 is a block diagram showing the arrangement of 

the fifth embodiment of the present invention. 

Referring to Fig. 15, in the fifth embodiment of the 

present invention, a demultiplexer section 510 

demultiplexes a code sequence input through an input 
10 terminal 500 into a spectrum parameter, an adaptive 

codebook delay, an adaptive codebook vector, a sound 

source gain, an amplitude or polarity code vector as sound 

source information, and a code representing a pulse 

position, and outputs them. 
15 The demultiplexer section 510 decodes the adaptive 

codebook and sound source gains by using a gain codebook 

380 and outputs them. 

An adaptive codebook circuit 520 decodes the delay 

and adaptive codebook vector gains and generates an 
2 0 adaptive codebook recons tructi on s ignal by us i ng a 

synthesis filter input signal in a past subframe. 

A mode discrimination circuit 530 compares the 

adaptive codebook gain decoded in the past subframe with a 

predetermined threshold to discriminate whether the 
25 current subframe is voiced or unvoiced, and outputs the 
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voiced/unvoiced discrimination information to a sound 
source signal reconstructing circuit 540. 

The sound source signal reconstructing circuit 54 0 
receives the voiced/unvoiced discrimination information. 
If the information indicates a voiced sound, the sound 
source signal reconstructing circuit 540 decodes the pulse 
positions, and reads out code vectors from a sound source 
codebook 351. The circuit 540 then assigns amplitudes or 
polarities to the vectors to generate a predetermined 
number of pulses per subframe, thereby reclaiming a sound 
source signal . 

When the voiced/unvoiced discrimination information 
indicates an unvoiced sound, the sound source signal 
reconstructing circuit 540 reconstructs pulses from 
predetermined pulse positions, shift amounts, and 
amplitude or polarity code vectors. 

A spectrum parameter decoding circuit 570 decodes a 
spectrum parameter and outputs the resultant data to a 
synthesis filter 560 

An adder 550 adds the adaptive codebook output signal 
and the output signal from the sound source signal 
reconstructing circuit 540 and outputs the resultant 
signal to the synthesis filter 560. 

The synthesis filter 560 receives the output from the 
adder 550, reproduces speech, and outputs it from a 



terminal 580 . 
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WHAT IS CLAIMED IS: ^ 
1. A speech coding apparatus including at least 
a spectrum parameter calculation section for 

receiving a speech signal, obtaining a spectrum parameter, 

and quantizing the spectrum parameter, 

an adaptive codebook section for obtaining a delay 

and a gain from a past quantized sound source signal by 

using an adaptive codebook, and obtaining a residue by 

predicting a speech signal , and 

a sound source quantization section for quantizing a 

sound source signal of the speech signal by using the 

spectrum parameter and outputting the sound source signal , 

comprising: 

a discrimination section for discriminating a mode on 
the basis of a past quantized gain of an adaptive 
codebook ; 

a sound source quantization section which has a 
codebook for representing a sound source signal by a 
combination of a plurality of non-zero pulses and 
collectively quantizing amplitudes or polarities of the 
pulses when an output from said discrimination section 
indicates a predetermined mode, and searches combinations 
of code vectors stored in said codebook and a plurality of 
shift amounts used to shift positions of the pulses so as 
to output a combination of a code vector and shift amount 
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which minimizes distortion relative to input speech; and 

a multiplexer section for outputting a combination of 
an output from said spectrum parameter calculation section / 
an output from said adaptive codebook section, and an 
5 output from said sound source quantization section. 



2. A speech coding apparatus including at least 




a spectrum parameter calculation section for 

receiving a speech signal, obtaining a spectrum parameter, 

and quantizing the spectrum parameter, 
10 an adaptive codebook section for obtaining a delay 

and a gain from a past quantized sound source signal by 

using an adaptive codebook, and obtaining a residue by 

predicting a speech signal, and 

a sound source quantization section for quantizing a 
15 sound source signal of the speech signal by using the 

spectrum parameter and outputting the sound source signal , 

comprising: 

a discrimination section for discriminating a mode on 
the basis of a past quantized gain of an adaptive 
2 0 codebook ; 

a sound source quantization section which has a 
codebook for representing a sound source signal by a 
combination of a plurality of non-zero pulses and 
collectively quantizing amplitudes or polarities of the 
2 5 pulses when an output from said discrimination section 



indicates a predetermined mode, and outputs a code vector 
that minimizes distortion relative to input speech by 
generating positions of the pulses according to a 
predetermined rule; and 

a multiplexer section for outputting a combination of 
an output from said spectrum parameter calculation section, 
an output from said adaptive codebook section, and an 
output from said sound source quantization section. 



3. A speech coding apparatus including at least 

a spectrum parameter calculation section for 
receiving a speech signal, obtaining a spectrum parameter, 
and quantizing the spectrum parameter, 

an adaptive codebook section for obtaining a delay 
and a gain from a past quantized sound source signal by 
using an adaptive codebook, and obtaining a residue by 
predicting a speech signal, and 

a sound source quantization section for quantizing a 
sound source signal of the speech signal by using the 
spectrum parameter and outputting the sound source signal, 
comprising: 

a discrimination section for discriminating a mode on 
the basis of a past quantized gain of an adaptive 
codebook ; 

a sound source quantization section which has a 
codebook for representing a sound source signal by a 




combination of a plurality of non-zero pulses and 
collectively quantizing amplitudes or polarities of the 
pulses when an output from said discrimination section 
indicates a predetermined mode, and a gain codebook for 
quantizing gains, and searches combinations of code 
vectors stored in said codebook, a plurality of shift 
amounts used to shift positions of the pulses, and gain 
code vectors stored in said gain codebook so as to output 
a combination of a code vector, shift amount, and gain 
code vector which minimizes distortion relative to input 
speech ; and 

a multiplexer section for outputting a combination of 
an output from said spectrum parameter calculation section, 
an output from said adaptive codebook section, and an 
output from said sound source quantization section. 



4. A speech coding apparatus including at least * 

a spectrum parameter calculation section for 
receiving a speech signal, obtaining a spectrum parameter, 
and quantizing the spectrum parameter, 

an adaptive codebook section for obtaining a delay 
and a gain from a past quantized sound source signal by 
using an adaptive codebook, and obtaining a residue by 
predicting a speech signal, and 

a sound source quantization section for quantizing a 
sound source signal of the speech signal by using the 




spectrum parameter and outputting the sound source signal, 
comprising: 

a discrimination section for discriminating a mode on 
the basis of a past quantized gain of a adaptive codebook; 

a sound source quantization section which has a 
codebook for representing a sound source signal by a 
combination of a plurality of non-zero pulses and 
collectively quantizing amplitudes or polarities of the 
pulses when an output from said discrimination section 
indicates a predetermined mode, and a gain codebook for 
quantizing gains, and outputs a combination of a code 
vector and gain code vector which minimizes distortion 
relative to input speech by generating positions of the 
pulses according to a predetermined rule; and 

a multiplexer section for outputting a combination of 
an output from said spectrum parameter calculation section, 
an output from said adaptive codebook section, and an 
output from said sound source quantization section. 

5. A speech decoding apparatus comprising: 

a demultiplexer section for receiving and 
demultiplexing a spectrum parameter, a delay of an 
adaptive codebook, a quantized gain, and quantized sound 
source information; 

a mode discrimination section for discriminating a 
mode by using a past quantized gain in said adaptive 
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codebook ; and 

a sound source signal reconstructing section for 
reconstructing a sound source signal by generating 
non-zero pulses from the quantized sound source 
5 information when an output from said discrimination 
section indicates a predetermined mode, 

wherein a speech signal is reproduced by passing the 
sound source signal through a synthesis filter section 
constituted by spectrum parameters* 
10 6. A speech coding/decoding apparatus comprising: ^ 

a speech coding apparatus including 

a spectrum parameter calculation section for 
receiving a speech signal , obtaining a spectrum parameter, 
and quantizing the spectrum parameter, 
15 an adaptive codebook section for obtaining a delay 

and a gain from a past quantized sound source signal by 
using an adaptive codebook, and obtaining a residue by 
predicting a speech signal, 

a sound source quantization section for quantizing a 
2 0 sound source signal of the speech signal by using the 
spectrum parameter and outputting the sound source signal, 

a discrimination section for discriminating a mode on 
the basis of a past quantized gain of a adaptive codebook, 
and 

2 5 a codebook for representing a sound source signal by 
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a combination of a plurality of non-zero pulses and 
collectively quantizing amplitudes or polarities of the 
pulses when an output from said discrimination section 
indicates a predetermined mode, 
5 said sound source quantization section searching 

combinations of code vectors stored in said codebook and a 
plurality of shift amounts used to shift positions of the 
pulses so as to output a combination of a code vector and 
shift amount which minimizes distortion relative to input 

10 speech, and further including 

a multiplexer section for outputting a combination of 
an output from said spectrum parameter calculation section, 
an output from said adaptive codebook section, and an 
output from said sound source quantization section; and 

15 a speech decoding apparatus including at least 

a demultiplexer section for receiving and 
demultiplexing a spectrum parameter, a delay of an 
adaptive codebook , a quantized gain, and quantized sound 
source information, 

2 0 a mode discrimination section for discriminating a 

mode by using a past quantized gain in said adaptive 
codebook , 

a sound source signal reconstructing section for 
reconstructing a sound source signal by generating 
25 non-zero pulses from the quantized sound source 
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information when an output from said discrimination 
section indicates a predetermined mode, and 

a synthesis filter section which is constituted by 
spectrum parameters and reproduces a speech signal by 
5 filtering the sound source signal. 

7. A speech coding/decoding apparatus comprising: 

a speech coding apparatus including 

a spectrum parameter calculation section for 
receiving a speech signal, obtaining a spectrum parameter, 
10 and quantizing the spectrum parameter, 

an adaptive codebook section for obtaining a delay 
and a gain from a past quantized sound source signal by 
using an adaptive codebook, and obtaining a residue by 
predicting a speech signal, 
15 a sound source quantization section for quantizing a 

sound source signal of the speech signal by using the 
spectrum parameter and outputting the sound source signal, 
a discrimination section for discriminating a mode on 
the basis of a past quantized gain of a adaptive codebook, 
2 0 and 

a codebook for representing a sound source signal by 
a combination of a plurality of non-zero pulses and 
collectively quantizing amplitudes or polarities of the 
pulses when an output from said discrimination section 
25 indicates a predetermined mode, 
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said sound source quantization section for outputting 
a combination of a code vector and shift amount which 
minimizes distortion relative to input speech by 
generating positions of the pulses according to a 
5 predetermined rule, and further including 

a multiplexer section for outputting a combination of 
an output from said spectrum parameter calculation section, 
an output from said adaptive codebook section, and an 
output from said sound source quantization section; and 
10 a speech decoding apparatus including at least 

a demultiplexer section for receiving and 
demultiplexing a spectrum parameter, a delay of an 
adaptive codebook, a quantized gain, and quantized sound 
source information , 
15 a mode discrimination section for discriminating a 

mode by using a past quantized gain in said adaptive 
codebook , 

a sound source signal reconstructing section for 
reconstructing a sound source signal by generating 
2 0 positions of pulses according to a predetermined rule and 
generating amplitudes or polarities for the pulses from a 
code vector when an output from said discrimination 
section indicates a predetermined mode, and 

a synthesis filter section which is constituted by 
2 5 spectrum parameters and reproduces a speech signal by 
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filtering the sound source signal. 

8. A speech coding apparatus comprising: 
a spectrum parameter calculation section for 

receiving a speech signal, obtaining a spectrum parameter, 
5 and quantizing the spectrum parameter; 

means for obtaining a delay and a gain from a past 

quantized sound source signal by using an adaptive 

codebook, and obtaining a residue by predicting a speech 

signal ; and 

10 mode discrimination means for receiving a past 

quantized adaptive codebook gain and performs mode 
discrimination associated with a voiced/unvoiced mode by 
comparing the gain with a predetermined threshold, and 
further comprising : 

15 sound source quantization means for quantizing a 

sound source signal of the speech signal by using the 
spectrum parameter and outputting the signal, and 
searching combinations of code vectors stored in a 
codebook for collectively quantizing amplitudes or 

2 0 polarities of a plurality of pulses in a predetermined 
mode and a plurality of shift amounts used to temporally 
shifting a predetermined pulse position so as to select a 
combination of an index of a code vector and a shift 
amount which minimizes distortion relative to input 

25 speech; 
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gain quantization means for quantizing a gain by 
using a gain codebook; and 

multiplex means for outputting a combination of 
outputs from said spectrum parameter calculation means, 
5 said adaptive codebook means, said sound source 
quantization means, and said gain quantization means. 

9 . An apparatus according to claim 8 , wherein said 
sound source quantization means uses a position generated 
according to a predetermined rule as a pulse position when 
10 mode discrimination indicates a predetermined mode. 

10 „ An apparatus according to claim 9 , wherein when 
mode discrimination indicates a predetermined mode, a 
predetermined number of pulse positions are generated by 
random number generating means and output to said sound 
15 source quantization means. 

11 . An apparatus according to claim 8 , wherein when 
mode discrimination indicates a predetermined mode, said 
sound source quantization means selects a plurality of 
combinations from combinations of all code vectors in said 
2 0 codebook and shift amounts for pulse positions in an order 
in which a predetermined distortion amount is minimized, 
and outputs the combinations to said gain quantization 
means , and 

said gain quantization means quantizes a plurality of 
2 5 sets of outputs from said sound source quantization means 



by using said gain codebook, and selects a combination of 
a shift amount, sound source code vector, and gain code 
vector which minimizes the predetermined distortion amount. 



ABSTRACT OF THE DISCLOSURE 

A speech coding apparatus includes a spectrum 
parameter calculation section , an adaptive codebook 
5 section, a sound source quantization section, a 
discrimination section, and a multiplexer section. The 
spectrum parameter calculation section receives a speech 
signal and quantizes a spectrum parameter. The adaptive 
codebook section obtains a delay and a gain from a past 

10 quantized sound source signal using an adaptive codebook, 
and obtains a residue by predicting a speech signal . The 
sound source quantization section quantizes a sound source 
signal using the spectrum parameter . The discrimination 
section discriminates the mode. The sound source 

15 quantization section has a codebook for representing a 
sound source signal by a combination of non-zero pulses 
and collectively quantizing amplitudes or polarities of 
the pulses in a predetermined mode, and searches 
combinations of code vectors and shift amounts used to 

2 0 shift the positions of the pulses to output a combination 
of a code vector and shift amount which minimizes 
distortion relative to input speech. The multiplexer 
section outputs a combination of outputs from the spectrum 
parameter calculation section, the adaptive codebook 

2 5 section, and the sound source quantization section. 
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